Item response theory: applications of modern test theory in medical education.

نویسنده

  • Steven M Downing
چکیده

CONTEXT Item response theory (IRT) measurement models are discussed in the context of their potential usefulness in various medical education settings such as assessment of achievement and evaluation of clinical performance. PURPOSE The purpose of this article is to compare and contrast IRT measurement with the more familiar classical measurement theory (CMT) and to explore the benefits of IRT applications in typical medical education settings. SUMMARY CMT, the more common measurement model used in medical education, is straightforward and intuitive. Its limitation is that it is sample-dependent, in that all statistics are confounded with the particular sample of examinees who completed the assessment. Examinee scores from IRT are independent of the particular sample of test questions or assessment stimuli. Also, item characteristics, such as item difficulty, are independent of the particular sample of examinees. The IRT characteristic of invariance permits easy equating of examination scores, which places scores on a constant measurement scale and permits the legitimate comparison of student ability change over time. Three common IRT models and their statistical assumptions are discussed. IRT applications in computer-adaptive testing and as a method useful for adjusting rater error in clinical performance assessments are overviewed. CONCLUSIONS IRT measurement is a powerful tool used to solve a major problem of CMT, that is, the confounding of examinee ability with item characteristics. IRT measurement addresses important issues in medical education, such as eliminating rater error from performance assessments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determination of the Parameters of Six Multiple Choice Tests of Mashhad University of Medical Sciences (1389-90) based on Item-Response Theory (IRT)

Background: According to the industrialization of countries and development of societies, tests and methods are required to employ people in industries and organizations and make the best selection in getting workforce. Interviews, Written tests  , and multiple choice tests are common methods used in employing people. Among these methods  , multiple choice tests is the easiest one because of th...

متن کامل

Psychometric Properties of the Brief Form of Professor-Students Rapport Scale-based on Classical Test Theory and Item-Response Theory

Introduction: In order to improve the quality of the teaching process, it is necessary to review the professor-student rapport. The purpose of the present study was to investigate the factor structure and item-response parameters of Professor-Students Rapport Scale-Brief (PSRS-B). Methods: In a descriptive-correlation study, 497 students from Shahid Beheshti University of Medical Sciences were ...

متن کامل

Psychometric Properties of State Level Subjective Vitality Scale based on classical test theory and Item-response theory

The purpose of the present study was to investigate the factor structure and Item-Response parameters of State Level of Subjective Vitality Scale. The research design was correlational, and the statistical population consisted of students of the Shahid Beheshti University of Tehran. Sample group including 240 students were selected through multi-stage sampling and completed Subjective Vitality ...

متن کامل

Selection the best Method of Equating Using Anchor-Test Design‎ in Item Response Theory ‎‎

Explaining the problem. The equating process is used to compare the scores of the two different tests with the same theme‎. ‎The goal of this research is finding the best method of equating data using Logistic model. ‎ Method. we are using the data of Ph.D‎. ‎test in Statistic major for two consecutive years 92 and 93‎. ‎For analyzing‎, ‎we are specifically using the tests of Statistics major ...

متن کامل

ویژگی‌های روانسنجی مقیاس افسردگی نوجوانان براساس نظریه سوال- پاسخ و مقایسه نتایج با نظریه کلاسیک آزمون

Background and Aim: The objective of this study was to assess the psychometric properties of the Adolescent Depression Scale (ADS) based on the item-response theory and compare the results with those based on the classic test theory. Materials and Methods: A total of 750 students (364 males and 386 females) were selected through multistage random clustering (levels proportional to size) and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Medical education

دوره 37 8  شماره 

صفحات  -

تاریخ انتشار 2003